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TECHNICAL FIELD 

The present invention relates to computing and floating point arithmetic, and, more 
particularly, to systems and methods to implement an improved floating point adder. 

BACKGROUND 

The Institute of Electrical and Electronics Engineers (IEEE) specifies a standard, IEEE 
Std 754, for representation and conversion of exponential or floating point numbers. For 
% example, integer numbers can be converted to exponential numbers and binary numbers can be 
In used to represent different parts of the exponential numbers. In particular, an exponential, or 
1=6 floating point, number includes a sign, a significand or mantissa, and an exponent. The precision 

k ; U 

L of the floating point number indicates the number of bits available to represent the floating point 
'1 number; that is, the higher the precision, the more bits available. A single precision floating 
!* point number is represented by 32 bits: one bit for the sign bit, eight bits for the exponent, and 23 
H ~ bits for the mantissa. For norm numbers, a bit value of one is understood to precede the 23 bits 
of the mantissa, becoming in effect, an implicit one most significant bit. 

Floating point arithmetic is used for high-powered computing operations that require 
millions or billions or more of floating point operations per second (FLOPS). Basic functional 
building blocks such as floating point adders, multipliers and dividers are used to perform the 
arithmetic operations on floating point numbers. Numerous methods and systems of 
20 implementing binary adders in compliance with the IEEE 754 standard are well known in the art. 
A common general technique for adding two floating point numbers includes aligning and then 
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adding the mantissas of the floating point numbers to produce an arithmetic result for a mantissa. 
Arriving at the final result for the mantissa and for the calculation typically involves continued 
shifting and aligning, followed by a final rounding step, according to, for example, the round to 
nearest method of IEEE Std 754. 

The design of floating point arithmetic functions to manage high amounts of data 
throughput at ever increasing speeds and in ever reducing chip area presents challenges for 
designers of circuit architecture to contend with. Implementations of floating point arithmetic 
functional blocks, such as binary floating point adder implementations, need to become faster, 
more efficient, and take up less space on-chip. Accordingly, it would be desirable to provide 
alternative implementations and schemes that do not suffer from the drawbacks and weaknesses 
of existing implementations but rather that are faster, more efficient, that consume incrementally 
less space on chip. 

SUMMARY 

The presently preferred embodiments described herein include systems and methods for 
implementing an improved floating point adder that integrates adding and rounding. 

A method of adding a first floating point number to a second floating point number is 
provided according to one aspect of the invention. According to the method, a first mantissa, a 
second mantissa, and an input bit are added together to produce a third mantissa. The third 
mantissa is normalized to produce a final mantissa. The third mantissa and the final mantissa are 
correctly rounded as a result of the act of adding. 
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A method of adding a first floating point number to a second floating point number is 
provided according to a further aspect of the invention. According to the method, a first 
mantissa is added to a second mantissa. A fourth mantissa is added to an input bit to round the 
fourth mantissa. A third mantissa is normalized. The acts of adding are integrated together 
within a single adder to produce the third mantissa. 

A method of adding a first floating point number to a second floating point number is 
provided according to another aspect of the invention. According to the method, a first mantissa, 
i*, a second mantissa, and an input bit are added together to produce a third mantissa and an output 
2 bit. The third mantissa is rounded by updating the input bit based on the output bit to produce a 
ID fourth mantissa. The fourth mantissa is normalized. The acts of adding and rounding are 
H integrated within a single adder so that a separate adder to produce a correctly rounded result is 
L not needed and so that the acts of adding and rounding are performed prior to the act of 
ijt normalizing. 

S A method of adding a first floating point number to a second floating point number is 

provided according to a further aspect of the invention. According to the method, a first floating 
point number having a first mantissa and a second floating point number having a second 
mantissa are received. The first floating point number, when added to the second floating point 
number, produces a third floating point number having a third mantissa. The first mantissa and 
the second mantissa are respectively left-shifted as appropriate to obtain a fourth mantissa and a 

20 fifth mantissa. A first carry bit is produced from a second carry bit and from round control 

variables derived from the first mantissa and the second mantissa. The fourth mantissa, the fifth 
mantissa and the first carry bit are added together to produce a sixth mantissa and the second 
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cany bit. The sixth mantissa is correctly rounded. The sixth mantissa is right-shifted to produce 
the third mantissa. 

A floating point adder system to add a first floating point number to a second floating 
point number is provided according to another aspect of the invention. The system includes an 
adder and a shifter coupled to the adder. The adder performs an add operation to add a first 
mantissa, a second mantissa, and an input bit together to produce a third mantissa. The shifter 
normalizes the third mantissa to produce a final mantissa, The third mantissa and the final 

!□ mantissa are correctly rounded as a result of the add operation performed by the adder. 

s n 

ii 

W BRIEF DESCRIPTION OF THE DRAWINGS 

=1 The foregoing and other features, aspects, and advantages will become more apparent 

m from the following detailed description when read in conjunction with the following drawings, 
wherein: 

FIG. 1 is a diagram illustrating an exemplary floating point adder according to a first 
embodiment; 

FIG. 2 is a diagram illustrating an exemplary floating point adder according to a second 

presently preferred embodiment; 

FIGS. 3A and 3B are diagrams illustrating the unpack stages according to FIGS. 1 and 2; 

20 and 
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FIG. 4 is a diagram illustrating an exemplary floating point adder according to a third 
presently preferred embodiment. 



DETAILED DESCRIPTION OF THE PRESENTLY PREFERRED EMBODIMENTS 

The present invention will now be described in detail with reference to the accompanying 
drawings, which are provided as illustrative examples of preferred embodiments of the present 
,S invention. 

jfl The presently preferred embodiments described herein include systems and methods for 

M) implementing an improved floating point adder that integrates adding and rounding. Two 

c ; 'i 

mantissas are added together to produce a correctly rounded mantissa prior to normalization of 

]i the mantissa, making a follow on rounding stage unnecessary and conserving more space on 

in 

!J chip. With feedback of a carry output of the adder, a less significant bit of the correctly rounded 
mantissa is a function of a more significant bit of the mantissa. 

FIG. 1 is a diagram illustrating an exemplary floating point adder (FADD) 100 according 
to a first embodiment. The FADD 100 includes a swapper stage 102, a first unpack stage 104, a 
second unpack stage 106, a right shifter 108, a complement stage 112, an adder stage 114, a 
normalizer stage 116, a subtract stage 122, an increment adder stage 124, round control logic 
126, a rounding stage 128, adders 130, 132, and a left shifter 134. The normalizer stage 116 

20 includes a count up to e 3 leading zeros stage 120 and a left shifter 118. 
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FIG. 2 is a diagram illustrating an exemplary floating point adder (FADD) 100 according 
to a second presently preferred embodiment. The FADD 200 includes the swapper stage 102, 
the first unpack stage 104, the second unpack stage 106, a right shifter 208, a complement stage 
212, an adder stage 214, a normalizer stage 216, a subtract stage 222, an increment adder stage 
224, round control logic 226, and carry logic 228. The normalizer stage 216 includes a count up 
to e 3 leading zeros stage 220 and a left shifter 218. 

FIGS. 3A and 3B are diagrams illustrating the unpack stages 104, 106 according to FIGS. 
1 and 2. The second unpack stage 106 of FIG. 3A unpacks a floating point number ft into a sign 
bit s 0 , a 23 bit mantissa m 0 , and an 8 bit exponent field e 0 . If e 0 is equal to zero, the second 
unpack stage 106 supplements the mantissa m 0 with "00" to form a 25 bit mantissa m 2 and forms 
an 8 bit exponent field e 2 that is equal to one. If, as is far more commonly the case, e 0 is not 
equal to zero but is less than 255, the second unpack stage 106 supplements the mantissa m 0 with 
"01" to form a 25 bit mantissa m 2 and forms an 8 bit exponent field e 2 that is equal to e 0 . 

Similarly, the first unpack stage 104 of FIG. 3B unpacks a floating point number ft into a 
sign bit si, a 23 bit mantissa m h and an 8 bit exponent field ei. If ei is equal to zero, the first 
unpack stage 104 supplements the mantissa mi with "00" to form a 25 bit mantissa m 3 and forms 
an 8 bit exponent field e 3 that is equal to one. If, as is far more commonly the case, ei is not 
equal to zero but is less than 255, the second unpack stage 104 supplements the mantissa mi with 
"01" to form a 25 bit mantissa m 3 and forms an 8 bit exponent field e 3 that is equal to ei. 

FIG. 4 is a diagram illustrating an exemplary floating point adder (FADD) 300 according 
to a third presently preferred embodiment. The FADD 300 includes the swapper stage 102, a 
first right shifter 308, a second right shifter 304, inverters 306, 310, 330, a complement stage 
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312, an adder stage 314, a normalizer stage 316, a subtract stage 322, an increment adder stage 
324, round control logic 326, carry logic 328, and eight two input AND gates 332. The 
normalizer stage 316 includes a count up to ei leading zeros stage 320 and a left shifter 318. 

Operation of the First Embodiment 

Processing of the exemplary FADD 100 shown in FIG. 1 according to the first 
embodiment is now described. The swapper stage 102 receives two floating point numbers, ii 
and i 0 , and compares theunsigned magnitudes of the numbers to determine their relative size. 
The number whose magnitude is less than or equal to theunsigned magnitude of the other 
number is designated f 0 and is processed on the right-hand side of FIG. 1 . The greater or 
equivalent other number is designated fi and is processed on the left-hand side of FIG. 1 . The 
floating point number f 0 includes a sign bit s 0 (bit 31 of 32), an eight bit exponent field e 0 (bits 
30-23 of 32), and a mantissa or significand field (bits 22-0 of 32). The floating point number fi 
includes a sign bit si (bit 31 of 32), an eight bit exponent field ei (bits 30-23 of 32), and a 
mantissa or significand field (bits 22-0 of 32). 

The floating point numbers fi and f 0 are processed by the unpack stages 104, 106, 
respectively, as described above, so that on the left-hand side of FIG. 1, the sign bit si and the 
eight bit exponent field e 3 are removed from f h leaving a 25 bit adjusted mantissa m 3 , and on the 
right-hand side of FIG. 1, the sign bit s 0 and the eight bit exponent field e 2 are removed from fo, 
leaving a 25 bit adjusted mantissa m 2 . 

Processing of the remaining mantissa m 3 continues downward with the introduction of 
two control variables, K 3 and L 3 to the adder 130. L 3 and K 3 respectively represent the least 
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significant bit m 3 [0] and the second least significant bit m 3 [l] of the mantissa m 3 that serves as 
an input to the adder stage 114. 

A control variable for the FADD 100, ADD, is calculated by applying the Boolean 
equivalence function to the sign bits S] and s 0 , so that when Si and s 0 are both false or are both 
true, the floating point numbers f 0 and fj have the same sign, an addition operation is being 
performed, and ADD is true, that is, is equal to a logic one. Similarly, when si and s 0 are 
different valued, a subtraction operation is being performed, and ADD is false, that is, is equal to 
a logic zero. The inverter 110 is provided to complement the ADD control variable to produce 
the SUB control variable which is input to the complement stage 112 and the adder 132. 

Processing continues and the right shifter 108 receives and shifts the bits of the 25 bit 
mantissa m 2 to the right according to the value of the following control variable, RSHIFT, where 

RSHIFT = e 3 - e 2 . 

Accordingly, if RSHIFT is equal to zero, the mantissa m 2 is not shifted. The right shifter 108 
produces a 25 bit mantissa nu as well as a series of control variables K4, L 4 , G4, R4, and S4. 
TABLE I is a truth table for generation of K4, L 4 , G 4 , R4, and S 4 within the right shifter 108 
given the value of the control variable RSHIFT. For example, if RSHIFT is equal to 23, then K4 
is equal to the most significant bit m 2 [24] of the mantissa m 2 , that is, K4 is False. The control 
variables L 4 and K4 respectively represent the least significant bit nutO] and the second least 
significant bit m^l] of the mantissa nu. 



Table I 

Truth Table for Generation of J 4 , K4, L 4 , G 4 , R4 and S 4 within Right Shifters 108, 208 (FIGS. 1 
and 2) 

where 1 - OR; and m 2 |T| is the ith digit of m 2 . 
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RSHIFT 


U 


K4 


U 


G 4 


R4 


s 4 


0 


False 


m 2 fll 


m 2 [0] 


False 


False 


False 


1 


False 


m 2 r21 


m 2 [l] 


m 2 roi 


False 


False 


2 


False 


m 2 [3] 


m 2 r21 


m 2 [l] 


m 2 [0] 


False 


3 


False 


m 2 [41 


m 2 r31 


m 2 r21 


m 2 m 


m 2 roi 


4 


False 


m 2 [5] 


m 2 [41 


m 2 [31 


m 2 [2] 


(m 2 m i m 2 roi) 


5 


False 


m 2 [6] 


m 2 [5] 


m 2 [41 


m 2 [31 


Cm 2 T21|m 2 ril|m 2 [01) 


6 


False 


m 2 r71 


m 2 [6] 


m 2 [51 


m 2 [41 


(m 2 [3H m 2 [0]) 


7 


False 


m 2 [8] 


m 2 r71 


m 2 [6] 


m 2 [51 


(m 2 [4H m 2 [0]) 


8 


False 


m 2 r91 


m 2 [8] 


m 2 r7] 


m 2 [6] 


(m 2 r51 1 | m 2 [0]) 


9 


False 


m 2 [101 


m 2 [9] 


m 2 [81 


m 2 [71 


(m 2 [6H m 2 [0]) 


10 


False 


m 2 [lll 


m 2 [10] 


m 2 r91 


m 2 [81 


(m 2 [7] 


i 


m 2 [0]) 


11 


False 


m 2 [12] 


m 2 [lll 


m 2 [10] 


m 2 [9] 


(m 2 [8] 




m 2 roi) 


12 


False 


m 2 [13] 


m 2 [12] 


m 2 [in 


m 2 [101 


(m 2 [9] 




tn 2 [0]) 


13 


False 


m 2 [14] 


m 2 [131 


m 2 [12] 


m 2 ril] 


(m 2 [10] 




m 2 [01) 


14 


False 


m 2 [15] 


m 2 [14] 


m 2 [13] 


m 2 [121 


(m 2 rmi 


m 2 r0]) 


15 


False 


m 2 ri6] 


m 2 [151 


m 2 ri41 


m 2 n3] 


(m 2 ri21 1 m 2 [01) 


16 


False 


m 2 [171 


m 2 [16] 


m 2 ri51 


m 2 [14] 


(m 2 [13] 


m 2 [0]) 


17 


False 


m 2 [18] 


m 2 [17] 


m 2 [16] 


m 2 ri5] 


(m 2 ri4H |m 2 [0]) 


18 


False 


m 2 [191 


m 2 [18] 


m 2 [17] 


m 2 [16] 


(m 2 ri51 |m 2 [0]) 


19 


False 


m 2 [20] 


m 2 [191 


m 2 [18] 


m 2 [17] 


fm 2 ri61 |m 2 [01) 


20 


False 


m 2 r21] 


m 2 [20] 


m 2 [19] 


m 2 [18] 


(m 2 [17] 


1 m 2 [0]) 


21 


False 


m 2 [22] 


m 2 [21] 


m 2 [20] 


m 2 [19] 


(m 2 [18] 


m 2 [0]) 


22 


False 


m 2 r23] 


m 2 [22] 


m 2 [21] 


m 2 [201 


(m 2 [19] 




m 2 [0]) 


23 


False 


False 


m 2 r231 


m 2 [22] 


m 2 [21] 


(m 2 [201 


m 2 [0]) 


24 


False 


False 


False 


m 2 [23] 


m 2 [22] 


(m 2 r211 


m 2 r0]) 


25 


False 


False 


False 


False 


m 2 [23] 


(m 2 r221 1 


m 2 [0]) 


26 


False 


False 


False 


False 


False 


(m 2 r231l m 2 [0]) 


< 27 


False 


False 


False 


False 


False 


(m 2 [231 m 2 [01) 



At the complement stage 112, if SUB is False, then ADD is True and an addition is being 
performed and none of the inputs to the complement stage 112 are complemented. If, however, 
SUB is True, each binary digit of the mantissa nu as well as each of the control variables K4, L 4 , 
G 4 , R4, and S 4 are complemented by the complement stage 112. The operation of the 
complement stage 112 is summarized as follows: 
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m 6 = {m 6 [24], me[23], . . m 6 [l], m 6 [0]} = {m4[24] A SUB, m4[23] A SUB, . . ni4[2] A SUB, 

rri4[l] A SUB}, 

K 6 = K4 A SUB = m4[l] A SUB = m 6 [l], 
U = U A SUB = m4[0] A SUB = m 6 [0], 
G 6 = G 4 A SUB, 
R 6 = R4 A SUB, and 

S6 = S 4 A SUB where A represents the Boolean exclusive-or operation. 
The 25 bit mantissa m 6 produced by the complement stage 112 serves as another input to the 
adder stage 114. The control variables K 6 and L 6 are input to the adder 130 and the control 
variables G 6 , R6, and S 6 are input the adder 132. 

TABLE II is a truth table for generation of Cin, Gg, R 8 , and S 8 within the adder 132 given 
the values of the control variable SUB and the input control variables G 6 , Re, and S 6 . For 
example, if SUB is True, and G 6 , Re, and S 6 are all True, then G 8 , Rg, and S 8 are all False and the 
overflow bit Cin is True. The signal Cin serves as a carry input to the adder stage 114 as well as 
the adder 130. 



Table II 

Truth Table for Generation of Cin, G 8 , R 8 , and S 8 within Adder 132 (FIG. 1) 
Cin = SUB&G 6 &R6&S 6 

where & = AND; and ~ = NOT 



SUB = -ADD 
(Adder 132 Carry In 
Bit) 


G6 R6 S6 

(Adder 132 Input) 


Gg Rg Sg 

(Adder 132 Sum) 


Cin 

(Adder 132 Carry Out 
Bit) 


False 


False False False 


False False False 


False 


False 


False False True 


False False True 


False 


False 


False True False 


False True False 


False 


False 


False True True 


False True True 


False 


False 


True False False 


True False False 


False 
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Table II 

Truth Table for Generation of Cin, G 8 , Rs, and S 8 within Adder 132 (FIG. 1) 
Cin=SUB&G 6 &R 6 &S 6 



where & = AND; and ~ 


= NOT 






SUB = -ADD 


G6 R6 S6 

\j \J \J 


G 8 R 8 S 8 


Cin 


(Adder 132 Carry In 


(Adder 132 Input) 


(Adder 132 Sum) 


(Adder 132 Carry Out 


Bit) 






Bit) 


False 


True False True 


True False True 


False 


False 


True True False 


True True False 


False 


False 


True True True 


True True True 


False 


True 


False False False 


False False True 


False 


True 


False False True 


False True False 


False 


True 


False True False 


False True True 


False 


True 


False True True 


True False False 


False 


True 


True False False 


True False True 


False 


True 


True False True 


True True False 


False 


True 


True True False 


True True True 


False 


True 


True True True 


False False False 


True 



TABLE III is a truth table for generation of K 8 and L 8 within the adder 130 given the 
values of Cin from the adder 132 and of the pairs of input control variables K 3 , L 3 and K*, L 6 . 
For example, if Cin is False, and K 3 > L 3 and K$, L 6 are all True, then K 8 and L 8 are True and 
False, respectively. The control variables K 8 and L 8 join with G 8 , R 8 , and S 8 to form a 5 bit word 
that is input to the left shifter 134. 



Table III 

Truth Table for Generation of Kg and L 8 within Adder 130 (FIG. 1) 


Cin 

(Adder 130 Carry In 
Bit) 


K3 L3 

(Adder 130 First 
Input) 


Kg L6 

(Adder 130 Second 
Input) 


Kg Lg 

(Adder 130 Sum 
Input) 


0 


00 


00 


00 


0 


00 


01 


01 


0 


00 


1 0 


1 0 


0 


00 


1 1 


1 1 


0 


01 


00 


01 


0 


01 


0 1 


1 0 
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Table III 

Truth Table for Generation of K 8 and L 8 within Adder 130 (FIG. 1) 


Cin 

(Adder 130 Carry In 
Bit) 


K 3 L 3 

(Adder 130 First 
Input) 

r i _ 


K$ L6 

(Adder 130 Second 
Input) 


K 8 L 8 

(Adder 130 Sum 
Input) 


0 


0 1 


1 0 


1 1 


0 


0 1 


1 1 


00 


o 


1 0 


00 


1 0 


o 


1 0 


0 1 


1 1 


o 


1 0 


1 0 


00 


o 


1 0 


1 1 


01 


o 


1 1 


00 


1 1 


o 


1 1 


0 1 


00 


o 


1 1 


1 0 


0 1 


o 


1 1 


1 1 


1 0 




00 


00 


01 




00 


01 


1 0 


1 


00 


1 0 


1 1 




0 0 


1 1 


00 


1 


0 1 


00 


1 0 




0 1 


0 1 


1 1 




0 1 


1 0 


00 




01 


1 1 


01 




1 0 


00 


1 1 




1 0 


01 


00 




1 0 


1 0 


01 




1 0 


1 1 


1 0 




1 1 


00 


00 




1 1 


01 


01 




1 1 


1 0 


1 0 




1 1 


1 1 


1 1 



The adder stage 114 receives the 25 bit mantissa m 3 , the 25 bit mantissa m 6 , and the Cin 
bit signal and adds these together to produce a 25 bit mantissa m 7 . The carry out bit of the adder 
stage 114 is discarded and is not used to obtain the final result of the FADD 100. 

Processing continues to the normalizer stage 116, which receives the 25 bit mantissa m 7 
and the control variable G 8 from the adder 132. The count up to e 3 leading zeros stage 120 
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examines each bit of the mantissa m 7 beginning with the most significant bit m 7 [24] and counts 
logic zeros until the stage 120 counts e 3 logic zeros, until the stage 120 encounters a logic one, or 
until the 25 bits of the mantissa m 7 are exhausted, whichever comes first. The stage 120 outputs 
a control variable LSHIFT that is equal to the number of counted leading logic zeros in the 
mantissa m 7 . The left shifter 118 receives and shifts the bits of the 25 bit mantissa m 7 to the left 
to produce a 25 bit mantissa mn according to the value of the control variable, LSHIFT. 
Accordingly, if LSHIFT is equal to zero, the mantissa m 7 is not shifted and the mantissa mn 
equals the mantissa m 7 . If LSHIFT is equal to one, then the control variable G 8 is shifted in to 
become the least significant bit of the mantissa m n . If LSHIFT is greater than one, the control 
variable G 8 is shifted in, followed by LSHIFT- 1 logic zeros, to form the latter part of the 
mantissa mio- 

Returning to the left shifter 134, the shifter 134 receives the 5 bit word consisting of the 
control variables K 8 , L 8 , G 8 , R 8 , and S 8 and shifts the bits of the 5 bit word to the left to produce 
a 5 bit word that consists of the control variables K 10 , Li 0 , Gio, Rio, and Sio, according to the 
value of the control variable, LSHIFT. TABLE IV is a truth table for generation of K ]0 , Li 0 , Gi 0 , 
R 10 ,and Si 0 within the left shifter 134 given the values of the control variable LSHIFT and the 
input control variables K 8 , L 8 , G 8 , R 8 ,and S 8 . For example, if LSHIFT is equal to three, then Kj 0 
is equal to R 8 , Li 0 is equal to S 8 , and Gi 0 , Rio, and Si 0 are all False. The 5 bit word of Ki 0 L 10 
do Rio Sio serves as the input to the round control logic 126. 



Table IV 

Truth Table for Generation of Ki 0 , Li 0 , Gio, Rio and Sio within Left Shifter 134 (FIG. 1) 
where I = OR 
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LEFT SHIFT 
(integer value of 8 
bit number) 




Lio 


Gio 


Rio 


Sio 


0 


Kg 


L 8 


G 8 


R 8 


s 8 


1 


Lg 


G 8 


R 8 


s 8 


False 


2 


G 8 


Rs 


s 8 


False 


False 


3 


Rg 


s 8 


False 


False 


False 


4 


Sg 


False 


False 


False 


False 


5 to e3 


False 


False 


False 


False 


False 



Next, the round control logic 126 receives the control variables Kio, Lio, Gio, Rio, and 
Sio. TABLE V is a truth table for generation of the control signal increment (INC) within the 
round control logic 126 given the values of the input control variables Kio, Lio, Gio, Rio,and Sio. 
For example, if K !0 and Sio are False and Lio, Gio, and R 10 are True, then INC is True. The 
signal INC serves as a carry input to the rounding stage 128. 



Table V 

Truth Table for Generation of INC within Round Control Logic 126 (FIG. 1) 


Kio 


Lio 


Gio 


Rio 


Sio 


INC 


False 


False 


False 


False 


False 


False 


False 


False 


False 


False 


True 


False 


False 


False 


False 


True 


False 


False 


False 


False 


False 


True 


True 


False 


False 


False 


True 


False 


False 


False 


False 


False 


True 


False 


True 


False 


False 


False 


True 


True 


False 


False 


False 


False 


True 


True 


True 


False 


False 


True 


False 


False 


False 


True 


False 


True 


False 


False 


True 


True 


False 


True 


False 


True 


False 


True 


False 


True 


False 


True 


True 


True 


False 


True 


True 


False 


False 


True 


False 


True 


True 


False 


True 


True 


False 


True 


True 


True 


False 


True 


False 


True 


True 


True 


True 


True 


True 


False 


False 


False 


False 


False 
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Table V 



Truth Table for Generation of INC within Round Control Logic 126 (FIG. 1) 





Lin 


^ 1 U 




^10 

JU 


INC 


Tvil A 

1 I Uv 


Palsp 

_L alow 


False 


False 


True 


False 


True 
i rue 


X CHOW 


False 


True 


False 


False 


i rue 


Th q 1 CA 

r disc; 




True 


True 


False 


i rue 


r disc 


True 


Fal se 


False 


False 


i rue 


J7 dloC 


True 


False 


True 


False 


True 


J/aloC 


X I UC 


True 


False 


False 


True 


False 


True 


True 


True 


True 


True 


True 


False 


False 


False 


True 


True 


True 


False 


False 


True 


True 


True 


True 


False 


True 


False 


True 


True 


True 


False 


True 


True 


True 


True 


True 


True 


False 


False 


True 


True 


True 


True 


False 


True 


True 


True 


True 


True 


True 


False 


True 


True 


True 


True 


True 


True 


True 



Processing of the mantissa m n continues downward with the removal of the least 
significant bit mi i [0] from the mantissa mi i to produce a 24 bit mantissa mi 2 . The rounding 
stage 128 adds the signal INC to the mantissa mi 2 to produce a 24 bit mantissa mi 3 . The 
rounding stage 128 is in effect an additional adder stage that follows the normalizer stage 116. 

Next, the most significant bit of the 24 bit mantissa mo is removed to produce the 23 bit 
mantissa m of the floating point number f reS uit output of the FADD 100. The most significant bit 
serves as an input to the increment adder stage 124. 

The subtract stage 122 subtracts the 8 bit control variable LSHIFT from the 8 bit 
exponent field value e 3 from the unpack stage 104 and produces an 8 bit output that is in turn 
sent to the increment adder stage 124. The increment adder stage 124 adds the 8 bit output of the 
subtract stage 122 to the one bit most significant bit described above to produce the 8 bit 
exponent field e of the floating point number f^uit- The sign bit si of the floating point number 
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ft becomes the sign bit s of the floating point number f resu it. All parts s, e, and m of the floating 
point number f resu it are thus known and processing of the FADD 100 terminates. 

Operation of the Second Embodiment 

Processing of the exemplary FADD 200 shown in FIG. 2 according to the second 
embodiment is now described. The swapper stage 102 receives two floating point numbers, ii 
and i 0 , and compares theunsigned magnitude of the numbers to determine their relative size. The 
number whose magnitude is less than or equal to theunsigned magnitude of the other number is 
designated f 0 and is processed on the right-hand side of FIG. 2. The greater or equivalent other 
number is designated f\ and is processed on the left-hand side of FIG. 2. The floating point 
number f 0 includes a sign bit s 0 (bit 3 1 of 32), an eight bit exponent field e 0 (bits 30-23 of 32), 
and a mantissa or significand field (bits 22-0 of 32). The floating point number fi includes a sign 
bit S] (bit 3 1 of 32), an eight bit exponent field ei (bits 30-23 of 32), and a mantissa or 
significand field (bits 22-0 of 32). 

The floating point numbers fi and f 0 are processed by the unpack stages 104, 106, 
respectively, as described above, so that on the left-hand side of FIG. 2, the sign bit Si and the 
eight bit exponent field e 3 are removed from fi, leaving a 25 bit adjusted mantissa m 3 , and on the 
right-hand side of FIG. 2, the sign bit s 0 and the eight bit exponent field e 2 are removed from f 0 , 
leaving a 25 bit adjusted mantissa m 2 . 

Processing of the remaining mantissa m 3 continues downward with the introduction of 
three control variables, J 3 , K 3 , and L 3 to the round control logic 226. J 3 , L 3 , and K 3 respectively 
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represent the most significant bit m 3 [24], the least significant bit m 3 [0], and the second least 
significant bit m 3 [l] of the mantissa m 3 that serves as an input to the adder stage 214. 

A control variable for the FADD 200, ADD, is calculated by applying the Boolean 
equivalence function to the sign bits si and s 0 , so that when si and s 0 are both false or are both 
true, the floating point numbers f 0 and fj have the same sign, an addition operation is being 
performed, and ADD is true, that is, is equal to a logic one. Similarly, when si and s 0 are 
different valued, a subtraction operation is being performed, and ADD is false, that is, is equal to 
a logic zero. The inverter 210 is provided to complement the ADD control variable to produce 
the SUB control variable which is input to the complement stage 212 and the round control logic 
226. 

Processing continues and the right shifter 208 receives and shifts the bits of the 25 bit 
mantissa m 2 to the right according to the value of the following control variable, RSHIFT, where 
RSHIFT = e 3 - e 2 . 

Accordingly, if RSHIFT is equal to zero, the mantissa m 2 is not shifted. The right shifter 208 
produces a 25 bit mantissa ni4 as well as a series of control variables J 4 , K4, L 4 , G 4 , R4, and S 4 . 
TABLE I is a truth table for generation of J 4 , Jm, L 4 , G 4 , R 4 , and S 4 within the right shifter 208 
given the value of the control variable RSHIFT. For example, if RSHIFT is equal to 23, then K4 
is equal to the most significant bit m 2 [24] of the mantissa m 2 , that is, ¥U is False. The control 
variables J 4 , L 4 , and K4 respectively represent the most significant bit m4[24], the least 
significant bit rn^O], and the second least significant bit m^l] of the mantissa nu. 

At the complement stage 212, if SUB is False, then ADD is True and an addition is being 
performed and none of the inputs to the complement stage 212 are complemented. If, however, 
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SUB is True, each binary digit of the mantissa 1114 as well as each of the control variables J4, K4, 
L4, G4, R4, and S4 are complemented by the complement stage 212. The operation of the 
complement stage 212 is summarized as follows: 

m 6 = {m 6 [24], m 6 [23], .... m 6 [l], m 6 [0]} = {m4[24] A SUB, m4[23] A SUB, m4[2] A SUB, 
m4[l] A SUB}, 

J 6 = J 4 A SUB = m4[24] A SUB = m 6 [24], 

K 6 = K4 A SUB = nuti] A SUB = m 6 [l], 

U = U A SUB = m4[0] A SUB - m 6 [0], 

G 6 = G 4 A SUB, 

R 6 = R4 A SUB, and 

S6 = S4 A SUB where A represents the Boolean exclusive-or operation. 
The 25 bit mantissa m6 produced by the complement stage 212 serves as another input to the 
adder stage 214. The control variable outputs J 6 , IQ, L 6 , G 6 , R6, and S 6 of the complement stage 
212 are input to the round control logic 226. 

TABLE VI is a truth table for generation of internal control variables L and K within the 
round control logic 226 given the values of the input control variables K 3 , K$, L 3 , and L6. For 
example, if K 3 , K<s, L 3 , and L 6 are all true, then L and K are False and True, respectively. 



Table VI 

Truth Table for Generation of L and K within Round Control Logic 226 (FIG. 2); K = K 3 A K« 
(L 3 & L 6 ); L = L 3 A U 

where A = EXCLUSIVE-OR; and & = AND 



K 3 


K6 


U 


U 


L 


K 


False 


False 


False 


False 


False 


False 


False 


False 


False 


True 


True 


False 


False 


False 


True 


False 


True 


False 
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Table VI 

Truth Table for Generation of L and K within Round Control Logic 226 (FIG. 2); K = K 3 A A 
(L 3 & L 6 ); L = L 3 A U 

where A = EXCLUSIVE-OR; and & = AND 






T 


T , 


T 


K 






True 


True 


False 


True 


False 


True 


False 


False 


False 


True 


False 


True 


False 


True 


True 


True 


False 


True 


True 


False 


True 


True 


False 


True 


True 


True 


False 


False 


True 


False 


False 


False 


False 


True 


True 


False 


False 


True 


True 


True 


True 


False 


True 


False 


True 


True 


True 


False 


True 


True 


False 


False 


True 


True 


False 


False 


False 


False 


True 


True 


False 


True 


True 


False 


True 


True 


True 


False 


True 


False 


True 


True 


True 


True 


False 


True 



TABLE VII is a truth table for generation of the control variables GEN, PROP and G 8 
H within the round control logic 226 given the values of the internal control variables K and L 
t shown in TABLE VI and of the input control variables SUB, J 3 , J 6 , G 6 , R 6 , and S 6 . For example, 
* if J 3 , h, SUB, K, L, G 6 , R 6 , and S 6 are all True, then GEN and PROP are both True and G 8 is 
False. The control variables GEN and PROP are input to the carry logic 228. G 8 serves as an 
input to the left shifter 218 of the normalizer stage 216. TABLE VII also explains the generation 
of the signal Cin within the carry logic 228 given the values of the control variables GEN and 
PROP from the round control logic 226. Similarly, TABLE VIII is a truth table for generation of 
10 the signal Cin within the carry logic 228 given the values of the control variables GEN and 

PROP and the signal C23 from the adder stage 214. C23 is the second most significant carry bit 
of the adder stage 214, with C24 or Cout representing the most significant carry bit of the adder 
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stage 214. For example, if GEN is False and PROP is True, then Cin follows the value of C23. 
If GEN and PROP are both False, then Cin is False. Finally, if GEN is True, then Cin is True. 



Table VII 

Truth Table for Generation of GEN, PROP, and G 8 within Round Control Logic 226 and Cin 
within Carry Logic 228 (FIG. 2); 

GEN = (-SUB & ~G 6 & (L | ~R 6 1 ~S 6 )) | (SUB & G 6 & (R 6 1 S 6 )) | (~J 6 & -SUB & K & L); 
PROP = (~J 3 & -SUB & L & (K | ~R 6 |~S 6 )) | (L & ~G 6 & Re & S 6 ) | (SUB & 
G 6 ); G 8 = (~G 6 & Re) | (G 6 & -Re & ~S 6 ) 

where & = AND; | = OR; and ~ = NOT; DC = Don't Care 



J 3 
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SUB 
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G 6 


R 6 
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GEN 


PROP 
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DC 
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Table VII 

Truth Table for Generation of GEN, PROP, and G 8 within Round Control Logic 226 and Cin 
within Carry Logic 228 (FIG. 2); 

GEN = (-SUB & ~G 6 & (L | -Re | ~S 6 )) I (SUB & G 6 & (R 6 1 S 6 )) | He & -SUB & K & L); 
PROP = (~J 3 & -SUB & L & (K | ~R 6 |~S 6 )) | (L & ~G 6 & R 6 & S 6 ) | (SUB & 
G 6 ); G 8 = (~G 6 & Re) | (G 6 & -Re & ~S 6 ) 

where & = AND; | = OR; and - = NOT; DC = Don't Care 
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Table VII 

Truth Table for Generation of GEN, PROP, and G 8 within Round Control Logic 226 and Cin 
within Carry Logic 228 (FIG. 2); 

GEN = (-SUB & ~G 6 & (L | -Re | ~S 6 )) | (SUB & G 6 & (R 6 1 S 6 )) I He & -SUB & K & L); 
PROP = (~J 3 & -SUB & L & (K | ~R 6 |~S 6 )) I (L & ~G 6 & R 6 & S 6 ) I (SUB & 
G 6 ); G 8 = (~G 6 & Re) | (G 6 & -Re & ~S 6 ) 

where & = AND; 1 = OR; and ~ = NOT; DC = Don't Care 
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Table VII 

Truth Table for Generation of GEN, PROP, and G 8 within Round Control Logic 226 and Cin 
within Carry Logic 228 (FIG. 2); 

GEN = (-SUB & ~G 6 & (L | ~R 6 1 ~S 6 )) I (SUB & G 6 & (R 6 1 S 6 )) I He & -SUB & K & L); 
PROP - (~J 3 & -SUB & L & (K | ~R 6 |~S 6 )) | (L & ~G 6 & Re & S 6 ) | (SUB & 
G 6 ); G 8 = (~G 6 & IU) | (G 6 & ~R 6 & ~S 6 ) 
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Table VII 

Truth Table for Generation of GEN, PROP, and G 8 within Round Control Logic 226 and Cin 
within Carry Logic 228 (FIG. 2); 

GEN = (-SUB & ~G 6 & (L | -Re | ~S 6 )) | (SUB & G 6 & (R 5 1 S 6 )) | He & -SUB & K & L); 

PROP = (~J 3 & -SUB & L & (K | ~R 6 |~S 6 )) | (L & ~G 6 & Re & S 6 ) | (SUB & 

G 6 ); G 8 = (~G 6 & Re) I (G 6 & ~R 6 & ~S 6 ) 

where & = AND; | = OR; and ~ = NOT; DC = Don't Care 
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Table VII 

Truth Table for Generation of GEN, PROP, and G 8 within Round Control Logic 226 and Cin 
within Carry Logic 228 (FIG. 2); 

GEN = (-SUB & ~G 6 & (L | ~R 6 1 ~S 6 )) | (SUB & G 6 & (R 6 1 S 6 )) | He & ~SUB & K & L); 
PROP = (~J 3 & -SUB & L & (K | ~R 6 |~S 6 )) | (L & ~G 6 & R 6 & S 6 ) | (SUB & 
G 6 ); G 8 = (~G 6 & Re) | (G 6 & -Re & ~S 6 ) 
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Table VII 

Truth Table for Generation of GEN, PROP, and G 8 within Round Control Logic 226 and Cin 
within Carry Logic 228 (FIG. 2); 

GEN = (-SUB & ~G 6 & (L | ~R 6 1 ~S 6 )) | (SUB & G 6 & (R 6 1 S 6 )) I (~J 6 & -SUB & K & L); 
PROP = (~J 3 & -SUB & L & (K | -Re |~S 6 )) | (L & ~G 6 & R 6 & S 6 ) | (SUB & 
G 6 ); G 8 = (~G 6 & Re) | (G 6 & ~R 6 & ~S 6 ) 

where & = AND; 1 = OR; and ~ = NOT; DC = Don't Care 
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Table VII 

Truth Table for Generation of GEN, PROP, and G 8 within Round Control Logic 226 and Cin 
within Carry Logic 228 (FIG. 2); 

GEN = (-SUB & ~G 6 & (L | -Re | ~S 6 )) | (SUB & G 6 & (R 6 1 S 6 )) I He & -SUB & K & L); 
PROP = (~J 3 & -SUB & L & (K | ~R 6 |~S 6 )) | (L & ~G 6 & R 6 & S 6 ) | (SUB & 
G 6 ); G 8 = (~G 6 & Re) | (G 6 & ~R 6 & ~S 6 ) 

where & = AND; [ = OR; and - = NOT; DC = Don't Care 
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Table VIII 








Truth Table for Generation of Cin within Carry Logic 228 (FIG. 2); 




Cin - GEN | (PROP & C23) 






where & - AND; 1 = OR 
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Table VIII 








Truth Table for Generation of Cin within Carry Logic 228 (FIG. 2); 




Cin = GEN | (PROP & C23) 






where & = AND; I = OR 
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The adder stage 214 receives the 25 bit mantissa m 3 , the 25 bit mantissa m 6 , and the Cin 
bit signal and adds these together to produce a 25 bit mantissa m 7 . The second most significant 
carry bit C23 of the adder stage 214 is fed back to the carry logic 228 as described above. Thus, 
the input carry bit Cin to the adder stage 214 is a function of the carry bit C23 of the adder stage 
214. 

Processing continues to the normalizer stage 216, which receives the 25 bit mantissa m 7 
and the control variable G 8 from the round control logic 226. The count up to e 3 leading zeros 
stage 220 examines each bit of the mantissa m 7 beginning with the most significant bit m 7 [24] 
and counts logic zeros until the stage 220 counts e 3 logic zeros, until the stage 220 encounters a 
logic one, or until the 25 bits of the mantissa m 7 are exhausted, whichever comes first. The stage 
220 outputs a control variable LSHIFT that is equal to the number of counted leading logic zeros 
in the mantissa m 7 . The left shifter 218 receives and shifts the bits of the 25 bit mantissa m 7 to 
the left to produce a 25 bit mantissa mio according to the value of the control variable, LSHIFT. 
Accordingly, if LSHIFT is equal to zero, the mantissa m 7 is not shifted and the mantissa mio 
equals the mantissa m 7 . If LSHIFT is equal to one, then the control variable G 8 is shifted in to 
become the least significant bit of the mantissa mi 0 . If LSHIFT is greater than one, the control 
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variable G 8 is shifted in, followed by LSHIFT-1 logic zeros, to form the latter part of the 
mantissa mio. 

Processing of the mantissa mjo continues downward with the removal of the least 
significant bit mi 0 [0] from the mantissa mi 0 . Next, the most significant bit of the resulting 24 bit 
mantissa is removed to produce the 23 bit mantissa m of the floating point number f resu it output of 
the FADD 200. The most significant bit serves as an input to the increment adder stage 224. 

The subtract stage 222 subtracts the 8 bit control variable LSHIFT from the 8 bit 
exponent field value e 3 from the unpack stage 104 and produces an 8 bit output that is in turn 
sent to the increment adder stage 224. The increment adder stage 224 adds the 8 bit output of the 
subtract stage 222 to the one bit most significant bit described above to produce the 8 bit 
exponent field e of the floating point number f resu i t . The sign bit si of the floating point number 
ft becomes the sign bit s of the floating point number f reS uit- AH parts s, e, and m of the floating 
point number f resu h are thus known and processing of the FADD 200 terminates. 

Operation of the Third Embodiment 

Processing of the exemplary FADD 300 shown in FIG. 4 according to the third 
embodiment is now described. The swapper stage 102 receives two floating point numbers, ii 
and i 0? and compares theunsigned magnitude of the numbers to determine their relative size. 
Theunsigned number that is less than or equal to the otherunsigned number is designated f 0 and 
is processed on the right-hand side of FIG. 4. The greater or equivalent otherunsigned number is 
designated ft and is processed on the left-hand side of FIG. 4. The floating point number f 0 
includes a sign bit s 0 (bit 3 1 of 32), an eight bit exponent field e 0 (bits 30-23 of 32), and a 
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mantissa or significand field (bits 22-0 of 32). The floating point number ft includes a sign bit si 
(bit 31 of 32), an eight bit exponent field ei (bits 30-23 of 32), and a mantissa or significand field 
(bits 22-0 of 32). 

On the left-hand side of FIG. 4, the sign bit si and the eight bit exponent field d are 
removed from ft. Processing of the remaining mantissa continues downward with the 
introduction of a guard bit Gi having a value of logic zero and an implicit one bit, producing a 25 
bit mantissa mi. 

On the right-hand side of FIG. 4, the sign bit s 0 and the eight bit exponent field e 0 are 
removed from ft. Processing of the remaining mantissa continues downward with the 
introduction of an implicit one bit, producing a 24 bit mantissa m 0 . The implicit one bit is the 
most significant bit mo[23] of the mantissa m 0 . 

A control variable for the FADD 300, ADD, is calculated by applying the Boolean 
equivalence function to the sign bits si and so, so that when s\ and so are both false or are both 
true, the floating point numbers ft and ft have the same sign, an addition operation is being 
performed, and ADD is true, that is, is equal to a logic one. Similarly, when si and s 0 are 
different valued, a subtraction operation is being performed, and ADD is false, that is, is equal to 
a logic zero. The inverter 310 is provided to complement the ADD control variable to produce 
the SUB control variable which is input to the complement stage 312 and the round control logic 
326. 

On the left-hand side of FIG. 4, processing continues at the second right shifter 304, 
which receives the control variable ADD. If ADD = True, then the second right shifter 304 
shifts the mantissa mi to the right by one to produce a mantissa m 3 so that 
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m 3 = {m 3 [24],m 3 [23], m 3 [l], m 3 [0]} = {0, mi[24]=l, mi[23], ...,mi[2],mi[l]}. 
If ADD=False, then mi passes through the second right shifter 304 without any change so that 
m 3 is equal to mi. 

Processing of the mantissa m 3 continues downward with the introduction of two control 
variables, L 3 and G 3 to the round control logic 326. G 3 and L 3 respectively represent the least 
significant bit m 3 [0], and the second least significant bit m 3 [l] of the mantissa m 3 . Next, the 
guard bit G 3 of m 3 , that is, m 3 [0], is removed and a new guard bit G 5 generated from the round 

^ control logic 326 is added as the new least significant bit to produce a mantissa m 5 . The most 
significant bit of m 5 , m 5 [24], is complemented by the inverter 306 and returned as the new most 

© significant bit to form a mantissa m 7 that serves as an input to the adder stage 314. 

H Returning to the right-hand side of FIG. 4, the first right shifter 308 receives and shifts 

the bits of the 24 bit mantissa mo to the right according to the value of the following control 

!H variable, RSHIFT, where 

V4 

[= RSHIFT = d - eo +ADD. 

jj JSC! 

Accordingly, if RSHIFT is equal to zero, the mantissa m 0 is not shifted. The first right shifter 
308 produces a 25 bit mantissa m 2 as well as a series of control variables L 2 , G 2 , R 2? and S 2 . 
TABLE IX is a truth table for generation of L 2 , G 2 , R 2 , and S 2 within the first right shifter 308 
given the value of the control variable RSHIFT. For example, if RSHIFT is equal to 23, then L 2 
is equal to the most significant bit m 0 [23] of the mantissa m 0? that is, L 2 is True. The control 
20 variables G 2 and L 2 respectively represent the least significant bit m 2 [0], and the second least 
significant bit m 2 [l] of the mantissa m 2 . 
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Table IX 

Truth Table for Generation of L 2 , G 2 , R 2 and S 2 within Right Shifter 308 (FIG. 4) 
where | = OR; and m 0 [i] is the ith digit of m 0 



RSHIFT 


L 2 


G 2 


R 2 


s 2 


0 


m 0 [0] 


False 


False 


False 


1 


m 0 [l] 


m 0 [0] 


False 


False 


2 


m 0 [2] 


morn 


m 0 [0] 


False 


3 


ftioPl 

y J L J 


m 0 [2] 


m 0 [l] 


m 0 roi 


4 


m 0 [41 


mo [3] 

v L J 


mo [21 

v L J 


(morn m 0 roi) 


5 


mnrsi 


m 0 [4] 


m 0 [3] 

" L J 


(m 0 r21 m 0 rnim o [0]) 


6 


mor61 


moTSl 


mo[4] 


(m 0 [3] | : 


mo[0]) 


7 


m 0 [71 

AAi UL J 


mn[61 

U L J 


morsi 


(mo[4] 


i 


m 0 [0]) 


8 


m 0 r81 


m 0 [7] 


mo[61 

U 1_ J 


(mor51| m 0 r0]) 


9 


mor91 


m 0 r81 

u L J 


mo[71 


(m 0 r61 m 0 r01) 


10 


m 0 rioi 


m 0 r91 


m 0 r81 


(mom mo[0]) 


11 


±1± \J L J 


monoi 

J - A - L UL J 


moM 


(m 0 r81 morOl) 


12 


mori21 


m 0 rin 

\J L J 


mn r 101 


(m 0 r91 ImofO]) 


13 


mnri31 


mn[121 


mnrill 


(moflOl | 


mofO]) 


14 


moFHl 


mnri31 


m 0 [121 


(moRll m 0 [0]) 


15 


mnF151 


mo[141 

V L J 


mn[131 


(mori21| m 0 r0]) 


16 


mo[16] 


mo[15] 


m 0 [14] 


(m 0 ri3H m 0 [0]) 


17 


m 0 [17] 


mo[16] 


m 0 [15] 


(m 0 ri4H m 0 r01) 


18 


m 0 [18] 


m 0 [17] 


m 0 [16' 


(m 0 [15] 


mo[0]) 


19 


m 0 [19] 


m 0 [18] 


m 0 [171 


(m 0 ri6] | 


mo[0]) 


20 


m 0 [20] 


m 0 [19] 


m 0 [18] 


(mo[17] | 


m 0 [0]) 


21 


mo [21] 


m 0 [20] 


mo[ 19] 


(m 0 [18] | 


m 0 [0]) 


22 


mo [22] 


m 0 [21] 


mo[20] 


(mori91 1 


mo[0]) 


23 


True 


m 0 [221 


m 0 r211 


(mo [20] | 


m 0 roi) 


24 


False 


True 


mo[22 


(mo[21] 




m 0 roi) 


25 


False 


False 


True 


(m 0 [22] | 


m 0 rO]) 


< 26 


False 


False 


False 


True 



At the complement stage 312, if SUB is False, then ADD is True and an addition is being 
performed and none of the inputs to the complement stage 312 are complemented. If, however, 
SUB is True, each binary digit of the mantissa m 2 as well as each of the control variables L 2? G 2 , 
R 2 , and S 2 are complemented by the complement stage 312. The operation of the complement 
stage 312 is summarized as follows: 
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nL, = {m 4 [24], m4[23], m 4 [l], m4[0]} = {m 2 [24] A SUB, m 2 [23] A SUB, m 2 [2] A SUB, 
m 2 [l] A SUB}, 

U = U A SUB = m 2 [l] A SUB = m4[l], 
G 4 = G 2 A SUB = m 2 [0] A SUB = m^O], 
R4 = R 2 A SUB, and 

S4 = S 2 A SUB where A represents the Boolean exclusive-or operation. 
The 25 bit mantissa m 4 produced by the complement stage 312 serves as another input to the 
adder stage 314. The control variable outputs L 4 , G 4 , R 4 , and S 4 of the complement stage 312 are 
input to the round control logic 326. 

TABLE X is a truth table for generation of internal control variables G, L, and G5 within 
the round control logic 326 given the values of the input control variables L 3 , L 4 , G3, and G 4 and 
the internal control variable X. For example, if L 3 , L 4 , G3, and G 4 are all true, then G and L are 
False and True. As another example, if G3 or X is True, then G5 is True; otherwise G5 is False. 
As described above, G5 becomes the least significant bit of the 25 bit mantissa m^. The internal 
control variable X used to produce G5 is generated according to TABLE XI below. 



Table X 

Truth Table for Generation of G, L and G 5 within Round Control Logic 326 (FIG. 4); L = L 3 A L 4 
A (G 3 & G 4 ); G = G 3 A G 4; G 5 = G 3 1 X 

where A - EXCLUSIVE-OR; & = AND; and ] = OR 



L 3 


U 


G 3 


G 4 


G 


L 


X 


G 5 


False 


False 


False 


False 


False 


False 


False 


False 


False 


False 


False 


True 


True 


False 


False 


False 


False 


False 


True 


False 


True 


False 


False 


True 


False 


False 


True 


True 


False 


True 


False 


True 


False 


True 


False 


False 


False 


True 


False 


False 


False 


True 


False 


True 


True 


True 


False 


False 
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Table X 

Truth Table for Generation of G, L and G 5 within Round Control Logic 326 (FIG. 4); L = L 3 A L 4 
A (G 3 & G 4 ); G = G 3 A G 4; G 5 = G 3 1 X 

where A = EXCLUSIVE-OR; & = AND; and | = OR _____ 



u 


U 


G 3 


G 4 


G 


L 


X 


G 5 


False 


True 


True 


False 


True 


True 


False 


True 


False 


True 


True 


True 


False 


False 


False 


True 


True 


False 


False 


False 


False 


True 


False 


False 


True 


False 


False 


True 


True 


True 


False 


False 


True 


False 


True 


False 


True 


True 


False 


True 


True 


False 


True 


True 


False 


False 


False 


True 


True 


True 


False 


False 


False 


False 


False 


False 


True 


True 


False 


True 


True 


False 


False 


False 


True 


True 


True 


False 


True 


False 


False 


True 


True 


True 


True 


True 


False 


True 


False 


True 


False 


False 


False 


False 


False 


False 


True 


True 


False 


-i— > 1 
False 


False 


True 


True 


raise 


Irue 


x rue 


False 


False 


irue 


raise 


x rue 


Th q1 OA 


X XUv 


True 

A I Uv 




r ciio - 


True 


True 


False 


True 


True 

_L M. KM. 


True 


False 


True 


False 


False 


False 


True 


True 


True 


False 


True 


False 


True 


True 


True 


True 


True 


False 


True 


True 


False 


True 


True 


True 


True 


False 


True 


True 


True 


False 


False 


True 


True 


True 


False 


False 


False 


False 


True 


True 


True 


True 


False 


False 


True 


True 


True 


True 


True 


True 


False 


True 


False 


True 


True 


True 


True 


True 


False 


True 


True 


False 


False 


True 


True 


True 


True 


False 


False 


False 


False 


True 


True 


True 


True 


False 


True 


True 


False 


True 


True 


True 


True 


True 


False 


True 


False 


True 


True 


True 


True 


True 


True 


False 


True 


True 


True 



TABLE XI is a truth table for generation of the control variables GEN and PROP and the 
internal control variable X within the round control logic 326 given the values of the internal 
control variables L and G shown in TABLE X and of the input control variables SUB, R4, and 
S 4 . For example, if L, G, R4, S 4 , and SUB are all True, then GEN and PROP are both True and 
X is False. The control variables GEN and PROP are input to the carry logic 328. TABLE XI 
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also explains the generation of the signal Cin within the carry logic 328 given the values of the 
control variables GEN and PROP from the round control logic 326. Similarly, TABLE XII is a 
truth table for generation of the signal Cin within the carry logic 328 given the values of the 
control variables GEN and PROP and the signal Cout from the adder stage 314. Cout is the most 
significant carry bit of the adder stage 214. For example, if GEN is False and PROP is True, 
then Cin follows the value of Cout. If GEN and PROP are both False, then Cin is False. Finally, 
if GEN is True, then Cin is True. 



Table XI 


















Truth Table for Generation of X, GEN, PROP within Round Control Logic 326 and Cin within 


Carry Logic 328 (FIG. 4); 

X = L&~G&R4&S 4 & SUB; GEN = ~X & R4 & (G | S 4 | SUB) | G & S 4 & SUB; PROP = G 


& (L | S 4 | 
where & = 


SUB) | X 
= AND; | = 


OR; and ~ : 


-NOT 












L 


G 


R4 


s 4 


SUB 


X 


GEN 


PROP 


Cin 


False 


False 


False 


False 


False 


False 


False 


False 


False 


False 


False 


False 


False 


True 


False 


False 


False 


False 


False 


False 


False 


True 


False 


False 


False 


False 


False 


False 


False 


False 


True 


True 


False 


False 


False 


False 


False 


False 


True 


False 


False 


False 


False 


False 


False 


False 


False 


True 


False 


True 


False 


True 


False 


True 


False 


False 


True 


True 


False 


False 


True 


False 


True 


False 


False 


True 


True 


True 


False 


True 


False 


True 


False 


True 


False 


False 


False 


False 


False 


False 


False 


False 


True 


False 


False 


True 


False 


False 


True 


Cout 


False 


True 


False 


True 


False 


False 


False 


True 


Cout 


False 


True 


False 


True 


True 


False 


True 


True 


True 


False 


True 


True 


False 


False 


False 


True 


False 


True 


False 


True 


True 


False 


True 


False 


True 


True 


True 


False 


True 


True 


True 


False 


False 


True 


True 


True 


False 


True 


True 


True 


True 


False 


True 


True 


True 


True 


False 


False 


False 


False 


False 


False 


False 


False 


True 


False 


False 


False 


True 


False 


False 


False 


False 


True 


False 


False 


True 


False 


False 


False 


False 


False 


True 


False 


False 


True 


True 


False 


False 


False 


False 
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Table XI 

Truth Table for Generation of X, GEN, PROP within Round Control Logic 326 and Cin within 
Carry Logic 328 (FIG. 4); 

X = L&~G&R4&S 4 & SUB; GEN = ~X & R4 & (G | S 4 1 SUB) | G & S 4 & SUB; PROP = G 
& (L I S 4 I SUB) I X 

where & = AND; | = OR; and ~ = NOT 



L 


G 


R4 


s 4 


SUB 


X 


GEN 


PROP 


Cin 


True 


False 


True 


False 


False 


False 


False 


False 


False 


True 


False 


True 


False 


True 


False 


True 


False 


True 


True 


False 


True 


True 


False 


False 


True 


False 


True 


True 


False 


True 


True 


True 


True 


False 


True 


Cout 


True 


True 


False 


False 


False 


False 


False 


True 


Cout 


True 


True 


False 


False 


True 


False 


False 


True 


Cout 


True 


True 


False 


True 


False 


False 


False 


True 


Cout 


True 


True 


False 


True 


True 


False 


True 


True 


True 


True 


True 


True 


False 


False 


False 


True 


True 


True 


True 


True 


True 


False 


True 


False 


True 


True 


True 


True 


True 


True 


True 


False 


False 


True 


True 


True 


True 


True 


True 


True 


True 


False 


True 


True 


True 



Table XII 

Truth Table for Generation of Cin within Carry Logic 328 (FIG. 4); 
Cin = GEN | (PROP & Cout) 
where & = AND; | = OR 


GEN 


PROP 


Cout 


Cin 


False 


False 


False 


False 


False 


False 


True 


False 


False 


True 


False 


False 


False 


True 


True 


True 


True 


False 


False 


True 


True 


False 


True 


True 


True 


True 


False 


True 


True 


True 


True 


True 



The adder stage 314 receives the 25 bit mantissa m 7? the 25 bit mantissa 1114, and the Cin 
bit signal and adds these together to produce a 25 bit mantissa m 8 . The most significant carry bit 
Cout of the adder stage 314 is fed back to the carry logic 328 as described above. Thus, the input 
carry bit Cin to the adder stage 314 is a function of the carry bit Cout of the adder stage 314. 
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Next, the most significant bit of m 8 , m 8 [24], is complemented by the inverter 330 and returned as 
the new most significant bit to form a mantissa m 9 that serves as an input to the normalizer stage 
316. 

Processing continues to the normalizer stage 316, which receives the 25 bit mantissa mg. 
The count up to ei leading zeros stage 320 examines each bit of the mantissa m 9 beginning with 
the most significant bit m 9 [24] and counts logic zeros until the stage 320 counts ei leading zeros, 
until the stage 320 encounters a logic one, or until the 25 bits of the mantissa m 9 are exhausted, 
whichever comes first. The stage 320 outputs a control variable LSHIFT that is equal to the 
number of counted leading logic zeros in the mantissa m 9 The left shifter 318 receives and shifts 
the bits of the 25 bit mantissa m 9 to the left to produce a 25 bit mantissa mi 0 according to the 
value of the control variable, LSHIFT. Accordingly, if LSHIFT is equal to zero, the mantissa m 9 
is not shifted and the mantissa mio equals the mantissa m 9 . If LSHIFT is greater than zero, then 
LSHIFT logic zeros are shifted in to form the latter part of the mantissa mi 0 . 

Processing of the mantissa mio continues downward with the removal of Gio from the 
mantissa mio. Gio represents the least significant bit mio[0] of the mantissa mio. Next, the most 
significant bit of the resulting 24 bit mantissa is removed to produce the 23 bit mantissa m of the 
floating point number f reS uit output of the FADD 300. 

The subtract stage 322 subtracts the 8 bit control variable LSHIFT from the 8 bit 
exponent field value ei and produces an 8 bit output that is in turn sent to the increment adder 
stage 324. The increment adder stage 324 adds the 8 bit output of the subtract stage 322 to the 
one bit control variable ADD to yield an 8 bit output. The count up to ei leading zeros stage 320 
also outputs a control variable NOTZERO that is True if the mantissa m 9 is not equal to zero and 
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False if 1119 is equal to zero. The eight AND gates 332 each receive a respective bit of the 8 bit 
output of the increment adder stage 324 along with the control variable NOTZERO. If the 
mantissa m 9 is nonzero, then NOTZERO is True and the 8 bit output of the increment adder 
stage 324 is passed through the eight AND gates 332 to produce the 8 bit exponent field e of the 
floating point number f reS uit. If the mantissa m 9 is zero, then NOTZERO is False, the outputs of 
the eight AND gates 332 are zero, and the 8 bit exponent field e is zero. The sign bit si of the 
floating point number fi becomes the sign bit s of the floating point number f resu it. All parts s, e, 
and m of the floating point number f resu it are thus known and processing of the FADD 300 
terminates. 

Example A 

An example that demonstrates the operation of the FADDs 100, 200, 300 according to the 
embodiments is now described. The swapper stage 102 receives two floating point numbers, ii 
and io, and compares the numbers to determine their relative size. The number that is less than or 
equal to the other number is designated fo. The floating point number fo has the following 
values: 

Decimal value: 8388609 = 2 23 +l, 
so = 0, 

e 0 = 1 00 1 0 1 1 0- 150, 
and 

Bit 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
m ffl = 0000000000000000000000 1. 
The floating point number fi has the following values: 

C:\TEMP\BEL-030 Application 07 30 01 FINAL.doc 



Believe Ref No. BEL-030 



CONFIDENTIAL AND PROPRIETARY TO BELIEVE, INC. 
-40- 

Atty.Dkt. 069102-0278148 



Decimal value: 8388610 = 2 ZJ +2, 
si=0, 

d = 1 0 0 1 0 1 10=150, 
and 

Bit 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
m fl = 0000000000000000000001 0. 

The ADD and SUB control variables are calculated: 

ADD = (so = = Si) = (0 = = 0) = 1 , indicating an addition operation, and 
i SUB = ~ADD = ~(1) = 0. 

Example A (First Embodiment) 

The floating point numbers ft and f 0 , including mi = m fl and m 0 = m^, are processed by 
%f| the unpack stages 104, 106, respectively, so that on the left-hand side of FIG. 1, the sign bit si 
M and the eight bit exponent field e3 = ei are removed from ft, leaving a 25 bit adjusted mantissa 

H m 3? 

Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
m 3 = 0100000000000000000000010 

and on the right-hand side of FIG. 1, the sign bit s 0 and the eight bit exponent field e 2 = e 0 are 
removed from f 0 , leaving a 25 bit adjusted mantissa m 2 : 

20 Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
m 2 = 010000000000000000000000 L 

Processing of the mantissa m 3 continues downward with the introduction of two control 
variables, K 3 = m 3 [l] = 1 and L 3 = m 3 [0] = 0 to the adder 130. 
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Processing continues and the right shifter 108 receives and shifts the bits of the 25 bit 
mantissa m 2 to the right according to the value of the following control variable, RSHIFT 

RSHIFT = e 3 -e 2 = (1 001 01 1 0) -(1 001011 0) = 0000000 0. 
Since RSHIFT is equal to zero, the mantissa iri2 is not shifted. The right shifter 108 produces a 
25 bit mantissa m 4 

Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
1114= 0100000000000000000000001 

as well as a series of control variables K4, L4, G4, R4, and S4. By inspection of TABLE I, 
m K4 = m 2 [l] = m4[l] = 0, U = m 2 [0] = m4[0] = 1, G 4 = R4 = S 4 = 0. 

M) At the complement stage 112, SUB is False since ADD is True and an addition is being 

^ performed and none of the inputs to the complement stage 112 are complemented. The operation 

of the complement stage 112 is summarized as follows: 

h Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
5 W m 6 = 0100000000000000000000001 

j 3 K 6 = m 6 [ 1 ] - 0, L 6 = m 6 [0] = 1 , G 6 = Re = S 6 = 0. 

The adder 132 receives the control variable SUB and the input control variables G 6? Ra, 
and Se, and, according to TABLE II, generates: 
G 8 = R 8 = S 8 = 0, and Cin - 0. 

The adder 130 receives the values of Cin from the adder 132 and of the pairs of input 
20 control variables K 3 , L 3 and K<$ ? Le, and, according to TABLE III, generates: 
K 8 = L 8 = 1 . 
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The adder stage 114 receives the 25 bit mantissa m 3 , the 25 bit mantissa m 6; and the Cin 
bit signal and adds these together to produce a 25 bit mantissa m 7 . The carry out bit of the adder 
stage 114 is discarded and is not used to obtain the final result of the FADD 100. 

Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
Carry 1 

m 3 = 0100000000000000000000010 
m 6 = 0100000000000000000000001 
Cin 0 
m 7 = 100000000000000000000001 1 

Processing continues to the normalizer stage 116, which receives the 25 bit mantissa m 7 
and the control variable G 8 from the adder 132. The control variable LSHIFT is equal to the 
number of counted leading logic zeros in the mantissa m 7 and is therefore zero. Since LSHIFT is 
equal to zero, the mantissa m 7 is not shifted and the mantissa mn equals the mantissa m 7 

Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
mn = 100000000000000000000001 1. 
The control variables K 8 and L 8 join with G 8? R 8 , and S 8 to form a 5 bit word that is input 

to the left shifter 134: 

Kg L 8 G 8 R 8 S 8 = 1 1 0 0 0. 

The shifter 134 receives the 5 bit word consisting of the control variables K 8 , Lg, G 8 , R 8 , 
and S 8 and shifts the bits of the 5 bit word to the left to produce a 5 bit word that consists of the 
control variables Kio, Lio, Gio, Rio 5 and Si 0 , according to the value of the control variable, 
LSHIFT. Since LSHIFT - 0, there is no shift and, according to TABLE IV, the 5 bit word is 
equal to 

Kio Lio Gio Rio Sio = Kg L 8 G 8 R 8 S 8 = 1 1 0 0 0. 
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Next, the round control logic 126 receives the control variables Kjo, L ]05 Gio, Rio, and 
Sio, and, according to TABLE V, generates: 
INC=1. 

The signal INC serves as a carry input to the rounding stage 128. 

Processing of the mantissa mn continues downward with the removal of the least 
significant bit mn[0] from the mantissa mn to produce a 24 bit mantissa mi 2 

Bit 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
m 12 = 10000000000000000000000 1. 
The rounding stage 128 adds the signal INC to the mantissa m i2 to produce a 24 bit 

mantissa mn 

Bit 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
mi3= 10000000000000000000001 0. 
Next, the most significant bit of the 24 bit mantissa m^ is removed to produce the 23 bit 

mantissa m 

Bit 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
m= 00000000000000000000010 
of the floating point number f resu it output of the FADD 100. The most significant bit msb=l 
serves as an input to the increment adder stage 124. 

The subtract stage 122 subtracts the 8 bit control variable LSHIFT 00000000 from the 8 
bit exponent field value e3 100101 10 from the unpack stage 104 and produces an 8 bit output 
100101 10 that is in turn sent to the increment adder stage 124. The increment adder stage 124 
adds the 8 bit output 100101 10 of the subtract stage 122 to the one bit most significant bit msb=l 
to produce the 8 bit exponent field e 1 00101 1 1 of the floating point number f resu it- The sign bit Si 
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= 0 of the floating point number ft becomes the sign bit s of the floating point number f resu it. All 
parts s, e, and m of the floating point number f resu it are thus known and processing of the FADD 
100 terminates. 

Example A (Second Embodiment) 

The floating point numbers ft and fo, including mi = m^ and mo = m^, are processed by 
the unpack stages 104, 106, respectively, so that on the left-hand side of FIG. 2, the sign bit Si 
and the eight bit exponent field e3 = ei are removed from ft, leaving a 25 bit adjusted mantissa 

S© Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
i2 m 3 = 0100000000000000000000010 

J y and on the right-hand side of FIG. 2, the sign bit s 0 and the eight bit exponent field e 2 = e 0 are 
removed from f 0 , leaving a 25 bit adjusted mantissa m 2 : 

5 5 H 

U Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
H m 2 = 010000000000000000000000 1. 

Processing of the mantissa m 3 continues downward with the introduction of two control 
variables, J 3 = m 3 [24] = 0, K 3 = m 3 [l] = 1, and L 3 = m 3 [0] = 0 to the round control logic 226. 

Processing continues and the right shifter 208 receives and shifts the bits of the 25 bit 
mantissa m 2 to the right according to the value of the following control variable, RSHIFT 
20 RSHIFT = e 3 -e 2 = (1 0 0 1 0 1 1 0) - (1 0 0 1 0 1 1 0) = 0 0 0 0 0 0 0 0. 

Since RSHIFT is equal to zero, the mantissa m 2 is not shifted. The right shifter 208 produces a 
25 bit mantissa rm 

Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
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m 4 = 010000000000000000000000 
as well as a series of control variables J 4? K4, L 4 , G 4 , R4 ? and S 4 . By inspection of TABLE I, 

J 4 - m 2 [24] = m424] = 0, K4 = m 2 [l] = m^l] = 0, L 4 = m 2 [0] = imtO] = 1, and 

G 4 - R4 = S 4 = 0. 

At the complement stage 212, SUB is False since ADD is True and an addition is being 
performed and none of the inputs to the complement stage 212 are complemented. The operation 
of the complement stage 212 is summarized as follows: 

Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 
m 6 = 010000000000000000000000 

J 6 - m 6 [24] - 0 5 K6 = m 6 [l] = 0, I* = m 6 [0] = 1, G 6 = R 6 = S 6 - 0. 
The round control logic 226 receives the control variable SUB and the input control 
variables J 3 , K 3 > L 3 , J 6 , L 6 , G 6 , R 6 , and S 6? and, according to TABLES VI and VII, generates: 
G 8 = Don't Care, 
GEN = True = 1, and 
PROP - True = 1. 

The control variables GEN and PROP and the signal C23 from the adder stage 214 are 
input to the carry logic 228, which, according to TABLES VII and VIII, generates 
Cin = True = 1 . 

In this example, Cin does not depend on the value of C23 from the adder stage 214. 

The adder stage 214 receives the 25 bit mantissa m 3 , the 25 bit mantissa m& and the Cin 
bit signal and adds these together to produce a 25 bit mantissa m 7 . The second most significant 
carry bit C23 of the adder stage 214 is fed back to the carry logic 228. 

Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 
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Carry 10 11 
m 3 = 0100000000000000000000010 
m 6 = 0100000000000000000000001 

Cin 1 
mi = 1000000000000000000000100 
C23 = CARRY BIT(m 3 [23]+m 6 [23]+C22) = CARRY BIT(1 + 1 + 0) = 1. 

Processing continues to the normalizer stage 216, which receives the 25 bit mantissa m-j 
and the control variable Gg from the round control logic 226. The control variable LSHIFT is 
equal to the number of counted leading logic zeros in the mantissa m 7 and is therefore zero. 
Since LSHIFT is equal to zero, the mantissa m 7 is not shifted and the mantissa mio equals the 
mantissa m 7 

Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
I" m,<f= 100000000000000000000010 0. 
. j Processing of the mantissa mio continues downward with the removal of the least 

|4 significant bit mjo[0] from the mantissa mio. Next, the most significant bit of the resulting 24 bit 

B mantissa is removed to produce the 23 bit mantissa m 

Bit 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
m= 00000000000000000000010 

of the floating point number f resu it output of the FADD 200. The most significant bit msb=l 
20 serves as an input to the increment adder stage 224. 

The subtract stage 222 subtracts the 8 bit control variable LSHIFT 00000000 from the 8 
bit exponent field value e 3 100101 10 from the unpack stage 104 and produces an 8 bit output 
100101 10 that is in turn sent to the increment adder stage 224. The increment adder stage 224 
adds the 8 bit output 100101 10 of the subtract stage 222 to the one bit most significant bit msb=l 
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to produce the 8 bit exponent field e 100101 1 1 of the floating point number f resu it. The sign bit s\ 
= 0 of the floating point number fj becomes the sign bit s of the floating point number f reS uit. All 
parts s, e, and m of the floating point number f reS uit are thus known and processing of the FADD 
200 terminates. 

Example A (Third Embodiment) 

On the left-hand side of FIG. 4, the sign bit si and the eight bit exponent field ej are 
> n removed from the floating point number f u leaving the 23 bit mantissa mn. Processing of the 
,S mantissa mn continues downward with the introduction of a guard bit Gi having a value of logic 
Sj) zero as the new least significant bit and an implicit one bit as the new most significant bit, 
M producing a 25 bit mantissa mi 

„ 

s Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 

Q m = 1000000000000000000000100. 

u and on the right-hand side of FIG. 4, the sign bit s 0 and the eight bit exponent field e 0 are 

f« removed from the floating point number f 0? leaving the 23 bit mantissa mfo. Processing of the 

mantissa mfo continues downward with the introduction of an implicit one bit as the new most 

significant bit, producing a 24 bit mantissa mo 

Bit 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
m 0 = 10000000000000000000000 L 

20 On the left-hand side of FIG. 4, processing continues at the second right shifter 304, 

which receives the control variable ADD. ADD is True, so the second right shifter 304 shifts the 
mantissa mi to the right by one to produce a mantissa m 3 

Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
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m 3 = 010000000000000000000001 0. 

Processing of the mantissa 1113 continues downward with the introduction of two control 
variables, L3 = m3[l] = 1, and G 3 = m3[0] = 0 to the round control logic 326. 

Processing continues and the first right shifter 308 receives and shifts the bits of the 25 
bit mantissa m 0 to the right according to the value of the following control variable, RSHIFT 

RSHIFT = e 3 -e 2 + ADD- (1 0 0 1 0 1 1 0) - (1 0 0 1 0 1 1 0) + (1) = 0 0 0 0 0 0 0 1. 
Since RSHIFT is equal to one, each bit of the mantissa m 0 is shifted to the right by one. The first 
right shifter 308 produces a 25 bit mantissa m 2 

Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
m 2 = 0100000000000000000000001 

as well as a series of control variables L2, G2, R2, and S2. By inspection of TABLE IX, 
L 2 = m 0 [l] = m 2 [l] = 0, G 2 = m 0 [0] = m 2 [0] = 1, R 2 = S 2 = 0. 

At the complement stage 312, SUB is False since ADD is True and an addition is being 
performed and none of the inputs to the complement stage 312 are complemented. The operation 
of the complement stage 312 is summarized as follows: 

Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
m4= 0100000000000000000000001 
L 4 = nuti] = 0, G 4 = m4[0] = 1, R4 = S 4 = 0. 

The round control logic 326 receives the control variable SUB and the input control 
variables L3, G 3 , L 4 , G 4 , R 4> and S 4 , and, according to TABLES X and XI, generates: 
G 5 = False = 0, 
GEN = False = 0, and 
PROP = True - 1. 
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The control variables GEN and PROP and the signal Cout from the adder stage 314 are 
input to the carry logic 328, which, according to TABLES XI and XII, generates 
Cin = Cout. 

In this example, Cin follows the value of Cout from the adder stage 314. 

Next, the guard bit G3 of m 3 , that is, m 3 [0] = 0, is removed and a new guard bit G 5 = 0 
generated from the round control logic 326 is added as the new least significant bit to produce a 
mantissa m$ 

Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
m 5 = 010000000000000000000001 0. 

The most significant bit of m 5 , m 5 [24], is complemented by the inverter 306 and returned 
as the new most significant bit to form a mantissa m 7 

Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
m 7 = 1 10000000000000000000001 0. 

that serves as an input to the adder stage 314. 

The adder stage 314 receives the 25 bit mantissa m 7 , the 25 bit mantissa m4, and the Cin 
bit signal and adds these together to produce a 25 bit mantissa m 8 . The most significant carry bit 
Cout of the adder stage 314 is fed back to the carry logic 328. In order to calculate Cout from 
the adder stage 314, since Cin follows the value of Cout, an initial value of logic zero is assumed 
for Cin. If Cout is found to be equal to a logic one when the sum is performed, then Cin is 
adjusted to a logic one and the sum is performed again. The following illustrates the final sum, 
once Cin is found to be a logic one. 

Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
Carry 1 1 1 
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m 7 = 1 100000000000000000000010 
m 4 = 0100000000000000000000001 

Cin 1 
m 8 = 0000000000000000000000100 
Cout = CARRY BIT(m 7 [24]+m 4 [24]+C23) = CARRY BIT(1 + 0 + 1) = C24 = 1. 
Next, the most significant bit of m 8? m 8 [24], is complemented by the inverter 330 and 

returned as the new most significant bit to form a mantissa m 9 that serves as an input to the 

normalizer stage 316 

Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
m 9 = 100000000000000000000010 0. 

Processing continues to the normalizer stage 316, which receives the 25 bit mantissa m 9 . 
The control variable LSHIFT is equal to the number of counted leading logic zeros in the 
mantissa m 9 and is therefore zero. Since LSHIFT is equal to zero, the mantissa m 9 is not shifted 
and the mantissa mio equals the mantissa m 9 

Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
m 10 = 1 00000000000000000000010 0. 

Processing of the mantissa mio continues downward with the removal of the least 
significant bit Gj 0 from the mantissa mi 0 . Next, the most significant bit of the resulting 24 bit 
mantissa is removed to produce the 23 bit mantissa m 

Bit 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
m= 00000000000000000000010 

of the floating point number f resu it output of the FADD 300. 

The subtract stage 322 subtracts the 8 bit control variable LSHIFT 00000000 from the 8 
bit exponent field value ei 100101 10 and produces an 8 bit output 100101 10 that is in turn sent 
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to the increment adder stage 324. The increment adder stage 324 adds the 8 bit output 100101 10 
of the subtract stage 322 to the one bit control variable ADD=1 to produce the 8 bit exponent 
field e 100101 1 1 of the floating point number f resu it. The sign bit si = 0 of the floating point 
number f\ becomes the sign bit s of the floating point number f resu it. All parts s, e, and m of the 
floating point number f resu it are thus known and processing of the FADD 300 terminates. 

Example B 

Another example that demonstrates the operation of the FADDs 100, 200, 300 according 
to the embodiments is now described. The swapper stage 102 receives two floating point 
numbers, ii and io, and compares the numbers to determine their relative size. The number that is 
less than or equal to the other number is designated fo. The floating point number fo has the 
following values: 

Decimal value: -8388605.5 = -(2 23 )+2.5 ? 

So= 1, 

e 0 = l 00 1 0 1 0 1 = 149, 
and 

Bit 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
m*r= 111111111111111111110 11. 

The floating point number fi has the following values: 

Decimal value: 16777215 = 2 24 -l ? 

si=0 5 

d = 1 00 1 0 1 1 0 = 150, 
and 
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Bit 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
m fl = 11111111111111111111111. 
The ADD and SUB control variables are calculated: 

ADD = (so = = si) = (1 = = 0) = 0, indicating a subtraction operation, and 
SUB = ~ADD = ~(0)=1. 

Example B (First Embodiment) 

The floating point numbers ft and f 0 , including mi = mn and m 0 = m©, are processed by 
uj the unpack stages 104, 106, respectively, so that on the left-hand side of FIG. 1, the sign bit S\ 
It) and the eight bit exponent field e3 = ei are removed from ft, leaving a 25 bit adjusted mantissa 

m Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
j3 m 3 = 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
W and on the right-hand side of FIG. 1, the sign bit so and the eight bit exponent field e2 = eo are 

I™ removed from fo, leaving a 25 bit adjusted mantissa m2: 

Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
m 2 = 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 0 1 1. 
Processing of the mantissa 1113 continues downward with the introduction of two control 

variables, K3 = m3[l] = 1 and L 3 = m3[0] = 1 to the adder 130. 

20 Processing continues and the right shifter 108 receives and shifts the bits of the 25 bit 

mantissa m 2 to the right according to the value of the following control variable, RSHIFT 

RSHIFT = e3 -62 = (1 001011 0) -(1 001010 1) = 0000000 1. 
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Since RSHIFT is equal to one, the right shifter 108 shifts the bits of the mantissa m 2 to the right 
by one, producing a 25 bit mantissa nu 

Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
m4= 0 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 0 1 

as well as a series of control variables K4, L 4 , G 4 , R4, and S 4 . By inspection of TABLE I, with 
RSHIFT equal to one, 

K4 = m 2 [2] = ni4[l] = 0, L 4 = m 2 [l] = m^O] = 1, G 4 = m 2 [0] = 1, R4 = S 4 = 0. 

At the complement stage 112, SUB is True since ADD is False and a subtraction is being 

iQ performed and so the inputs to the complement stage 112 are complemented. The operation of 
W the complement stage 112 is summarized as follows: 

N Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
W m 6 = 1100000000000000000000010 
O Ko = m 6 [l] = 1, L 6 = m 6 [0] = 0, G 6 = 0, R 6 = S 6 = 1. 

M The adder 132 receives the control variable SUB and the input control variables Ra, 

and S6 ? and, according to TABLE II, generates: 
G 8 = 1, R 8 = S 8 = 0, and Cin = 0. 

The adder 130 receives the values of Cin from the adder 132 and of the pairs of input 
control variables K 3 , L 3 and Ke 9 L 6 , and, according to TABLE III, generates: 
K 8 = 0, L 8 = l. 

20 The adder stage 114 receives the 25 bit mantissa m 3 , the 25 bit mantissa m 6 , and the Cin 

bit signal and adds these together to produce a 25 bit mantissa m 7 . The carry out bit of the adder 
stage 114 is discarded and is not used to obtain the final result of the FADD 100. 

Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
C:\TEMP\BEL-030 Application 07 30 01 FINAL.doc 



Believe Ref No. BEL-030 



CONFIDENTIAL AND PROPRIETARY TO BELIEVE, INC. 
-54- 

Attv. Dkt. 069102-0278148 



Carry 11111111111111111111111 

m 3 = 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

m<r 1100000000000000000000010 
Cin 0 

m 7 = 0100000000000000000000001 

Processing continues to the normalizer stage 116, which receives the 25 bit mantissa my 
and the control variable Gg from the adder 132. The control variable LSHIFT is equal to the 
number of counted leading logic zeros in the mantissa m7 and is therefore one. Since LSHIFT is 
equal to one, the left shifter 118 shifts the bits of the mantissa m 7 to the left by one and brings in 
the control variable Gg = 1, producing the mantissa mn 

Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
mn= 100000000000000000000001 1. 

The control variables K 8 and L 8 join with Gg, Rg, and Sg to form a 5 bit word that is input 
to the left shifter 134: 

K 8 Lg G 8 R 8 S 8 = 01 100. 

The shifter 134 receives the 5 bit word consisting of the control variables Kg, L 8 , G 8 , Rg, 
and Sg and shifts the bits of the 5 bit word to the left to produce a 5 bit word that consists of the 
control variables K10, L10, Gio, Rio, and S10, according to the value of the control variable, 
LSHIFT, which in this case is equal to one. According to TABLE IV, the 5 bit word is equal to 

K10 L 10 G 10 Rio S10 = L 8 G 8 R 8 Sg False = 1 1 0 0 0. 

Next, the round control logic 126 receives the control variables K10, L10, Gio, Rio? and 
S10, and, according to TABLE V, generates: 
INC=1. 
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The signal INC serves as a carry input to the rounding stage 128. 

Processing of the mantissa mu continues downward with the removal of the least 
significant bit mn[0] from the mantissa mu to produce a 24 bit mantissa m i2 

Bit 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
m l2 = 10000000000000000000000 1. 
The rounding stage 128 adds the signal INC to the mantissa mi 2 to produce a 24 bit 

mantissa m^ 

Bit 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
mi 3 = 10000000000000000000001 0. 
Next, the most significant bit of the 24 bit mantissa mi 3 is removed to produce the 23 bit 

mantissa m 

Bit 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
m= 00000000000000000000010 
of the floating point number f resu it output of the FADD 100. The most significant bit msb=l 

serves as an input to the increment adder stage 124. 

The subtract stage 122 subtracts the 8 bit control variable LSHIFT 00000001 from the 8 

bit exponent field value e3 100101 10 from the unpack stage 104 and produces an 8 bit output 

10010101 that is in turn sent to the increment adder stage 124. The increment adder stage 124 

adds the 8 bit output 10010101 of the subtract stage 122 to the one bit most significant bit msb=l 

to produce the 8 bit exponent field e 1 001 01 1 0 of the floating point number f resu it. The sign bit S\ 

= 0 of the floating point number fj becomes the sign bit s of the floating point number f reS uit* All 

parts s 5 e, and m of the floating point number f resu it are thus known and processing of the FADD 

100 terminates. 
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Example B (Second Embodiment) 

The floating point numbers ft and fo, including mi = mn and mo = mfo, are processed by 
the unpack stages 104, 106, respectively, so that on the left-hand side of FIG. 2, the sign bit s\ 
and the eight bit exponent field e3 = ei are removed from ft, leaving a 25 bit adjusted mantissa 

m 3 , 

Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
m 3 = 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
and on the right-hand side of FIG. 2, the sign bit so and the eight bit exponent field e2 = eo are 

removed from fo, leaving a 25 bit adjusted mantissa nv?: 

Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
m 2 = 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 0 1 1. 

Processing of the mantissa m 3 continues downward with the introduction of two control 
variables, J 3 = m 3 [24] = 0, K 3 = m 3 [l] = 1, and L 3 = m 3 [0] = 1 to the round control logic 226. 

Processing continues and the right shifter 208 receives and shifts the bits of the 25 bit 
mantissa ni2 to the right according to the value of the following control variable, RSHIFT 

RSHIFT = 63-62 = (1 001011 0) -(1 001010 1) = 0000000 1. 
Since RSHIFT is equal to one, the right shifter 208 shifts the bits of the mantissa m2 to the right 
by one, producing a 25 bit mantissa ni4 

Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
1114= 0 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 0 1 
as well as a series of control variables J4, K4, L 4 , G4, R4, and S4. By inspection of TABLE I, with 

RSHIFT equal to one, 
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J 4 = ni4[24] = 0, K4 = m 2 [2] = nu[l] = 0, L 4 = m 2 [l] = ni4[0] = 1, G 4 = m 2 [0] = 1 and 
R4 = S 4 = 0. 

At the complement stage 212, SUB is True since ADD is False and a subtraction is being 
performed and so the inputs to the complement stage 212 are complemented. The operation of 
the complement stage 212 is summarized as follows: 

Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
m 6 =l 100000000000000000000010 

J 5 = m6[24] = 1, K6 = m 6 [l] = 1, L 6 = m 6 [0] - 0, G 6 = 0, R 6 - S 6 = 1. 
»Q The round control logic 226 receives the control variable SUB and the input control 

f ft variables J 3 , K 3 , L 3 , J 6 , Ke, L 6 , G 6? and S 6 , and, according to TABLES VI and VII, generates: 
^ G 8 = True, 

m GEN = False -0, and 

q PROP - True =1. 

14 The control variables GEN and PROP and the signal C23 from the adder stage 214 are 

M input to the carry logic 228, which, according to TABLES VII and VIII, generates 
Cin = C23. 

In this example, Cin follows the value of C23 from the adder stage 214. 

The adder stage 214 receives the 25 bit mantissa m 3 , the 25 bit mantissa m 6 , and the Cin 
bit signal and adds these together to produce a 25 bit mantissa m 7 . The second most significant 
20 carry bit C23 of the adder stage 214 is fed back to the carry logic 228. In order to calculate C23 
from the adder stage 214, since Cin follows the value of C23, an initial value of logic zero is 
assumed for Cin. If C23 is found to be equal to a logic one when the sum is performed, then Cin 
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is adjusted to a logic one and the sum is performed again. The following illustrates the final 
sum, once Cin is found to be a logic one. 

Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
Carry 111111111111111111111111 
m 3 = 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
m 6 = 1 100000000000000000000010 
Cin 1 
m 7 = 0100000000000000000000010 
_ C23 = CARRY BIT(m 3 [23]+m 6 [23]+C22) = CARRY BIT(1 + 1 + 1) = 1 . 

ll) Processing continues to the normalizer stage 216, which receives the 25 bit mantissa m 7 

"2 and the control variable Gg from the round control logic 226. The control variable LSHIFT is 
li equal to the number of counted leading logic zeros in the mantissa mj and is therefore one. 

Since LSHIFT is equal to one, the left shifter 218 shifts the bits of the mantissa m-? to the left by 

:": "'•) 

H one and brings in the control variable Gg = 1 , producing the mantissa mio 

M Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
fj m 10 = 100000000000000000000010 1. 
Processing of the mantissa mio continues downward with the removal of the least 

significant bit mio[0] from the mantissa mi 0 . Next, the most significant bit of the resulting 24 bit 

mantissa is removed to produce the 23 bit mantissa m 

20 Bit 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
m= 00000000000000000000010 
of the floating point number f resu it output of the FADD 200. The most significant bit msb=l 

serves as an input to the increment adder stage 224. 
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The subtract stage 222 subtracts the 8 bit control variable LSHIFT 00000001 from the 8 
bit exponent field value e 3 100101 10 from the unpack stage 104 and produces an 8 bit output 
10010101 that is in turn sent to the increment adder stage 224. The increment adder stage 224 
adds the 8 bit output 10010101 of the subtract stage 222 to the one bit most significant bit msb=l 
to produce the 8 bit exponent field e 100101 10 of the floating point number f resu it. The sign bit si 
= 0 of the floating point number ft becomes the sign bit s of the floating point number f resu it. All 
parts s, e, and m of the floating point number f resu it are thus known and processing of the FADD 
^ 200 terminates. 

|p Example B (Third Embodiment) 

H On the left-hand side of FIG. 4, the sign bit si and the eight bit exponent field ej are 

- removed from the floating point number f u leaving the 23 bit mantissa m fl . Processing of the 
'ji mantissa mfi continues downward with the introduction of a guard bit Gi having a value of logic 
\Z zero as the new least significant bit and an implicit one bit as the new most significant bit, 
producing a 25 bit mantissa mi 

Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
mi = 1111111111111111111111110. 

and on the right-hand side of FIG. 4 9 the sign bit s 0 and the eight bit exponent field e 0 are 
removed from the floating point number f 0j leaving the 23 bit mantissa mio. Processing of the 
20 mantissa m^ continues downward with the introduction of an implicit one bit as the new most 
significant bit, producing a 24 bit mantissa m 0 

Bit 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
m 0 = 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 0 1 L 
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On the left-hand side of FIG. 4, processing continues at the second right shifter 304, 
which receives the control variable ADD. ADD is False, so the second right shifter 304 does not 
shift the mantissa mi and produces a mantissa m 3 

Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
m 3 = 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 0 
that is equal to the mantissa mi. 

Processing of the mantissa 1113 continues downward with the introduction of two control 

variables, L 3 = m 3 [l] = 1, and G 3 = m 3 [0] = 0 to the round control logic 326. 

;:Q Processing continues and the first right shifter 308 receives and shifts the bits of the 25 

I s ® bit mantissa m 0 to the right according to the value of the following control variable, RSHIFT 

;M RSHIFT = e 3 -e 2 + ADD=(l 001011 0) -(1 001010 1) + (0) = 0 0 0 0 0 0 0 1. 

- Since RSHIFT is equal to one, each bit of the mantissa m 0 is shifted to the right by one. The first 

;ri right shifter 308 produces a 25 bit mantissa m 2 

S Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
]*: m 2 = 0 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 0 1 1 

as well as a series of control variables L 2 , G 2 , R 2 , and S 2 . By inspection of TABLE IX 5 
L 2 - m 0 [l] = m 2 [l] = 1, G 2 = m 0 [0] - m 2 [0] = 1, R 2 = S 2 = 0. 

At the complement stage 312, SUB is True since ADD is False and a subtraction is being 
performed and so the inputs to the complement stage 312 are complemented. The operation of 
20 the complement stage 312 is summarized as follows: 

Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
1114= 1000000000000000000000100 

U = m4[l] = 0, G 4 = m4[0] = 0, R4 = S 4 = 1. 
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The round control logic 326 receives the control variable SUB and the input control 
variables L3, G3, L4, G4, R4, and S4, and, according to TABLES X and XI, generates: 
G 5 = True= 1, 
GEN = False = 0, and 
PROP = True = 1. 

The control variables GEN and PROP and the signal Cout from the adder stage 314 are 
input to the carry logic 328, which, according to TABLES XI and XII, generates 
Cin = Cout. 

E J 
- f% 

,S In this example, Cin follows the value of Cout from the adder stage 314. 
If® Next, the guard bit G 3 of m 3; that is, m 3 [0] = 0, is removed and a new guard bit G 5 = 1 

H generated from the round control logic 326 is added as the new least significant bit to produce a 
E mantissa m 5 

l -.3 

Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 

£ » t 

H m 5 = 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1. 
H The most significant bit of m 5 , m 5 [24], is complemented by the inverter 306 and returned 

as the new most significant bit to form a mantissa m 7 

Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
m 7 = 0111111111111111111111111 

that serves as an input to the adder stage 314. 
20 The adder stage 314 receives the 25 bit mantissa m 7 , the 25 bit mantissa 1114, and the Cin 

bit signal and adds these together to produce a 25 bit mantissa m 8 . The most significant carry bit 
Cout of the adder stage 314 is fed back to the carry logic 328. In order to calculate Cout from 
the adder stage 314, since Cin follows the value of Cout, an initial value of logic zero is assumed 
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for Cin. If Cout is found to be equal to a logic one when the sum is performed, then Cin is 
adjusted to a logic one and the sum is performed again. The following illustrates the final sum, 
once Cin is found to be a logic one. 

Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 
Carry 11111111111111111 
m 7 = 01111111111111111 
m 4 = 10000000000000000 
Cin 

□ m 8 = 00000000000000000 
(H Cout = CARRY BIT(m 7 [24]+m4[24]+C23) = CARRY BIT(0 + 1 + 1) = C 

l Z Next, the most significant bit of m 8 , m 8 [24], is complemented by the inverter 330 and 

1 2 returned as the new most significant bit to form a mantissa m9 that serves as an input to the 
normalizer stage 316 

"4 Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
U m 9 - 100000000000000000000010 0. 
3.T Processing continues to the normalizer stage 316, which receives the 25 bit mantissa m9. 

The control variable LSHIFT is equal to the number of counted leading logic zeros in the 
mantissa m 9 and is therefore zero. Since LSHIFT is equal to zero, the mantissa m 9 is not shifted 
and the mantissa mi 0 equals the mantissa m 9 
20 Bit 24 23 22 21 20 19 18 17 16 15 14 13 12 II 10 9 8 7 6 5 4 3 2 1 0 
m 10 = 100000000000000000000010 0. 

Processing of the mantissa mio continues downward with the removal of the least 
significant bit do from the mantissa m i0 . Next, the most significant bit of the resulting 24 bit 
mantissa is removed to produce the 23 bit mantissa m 



7 6 5 4 3 2 1 0 
1111111 
11111111 
0 0 0 0 0 1 0 0 

1 

0 0 0 0 0 1 0 0 
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Bit 22 21 20 19 18 17 16 15 14 13 12 11 10 9 8 7 6 5 4 3 2 1 0 
m= 00000000000000000000010 
of the floating point number f resu it output of the FADD 300. 

The subtract stage 322 subtracts the 8 bit control variable LSHIFT 00000000 from the 8 
bit exponent field value ei 100101 10 and produces an 8 bit output 100101 10 that is in turn sent 
to the increment adder stage 324. The increment adder stage 324 adds the 8 bit output 100101 10 
of the subtract stage 322 to the one bit control variable ADD=0 to produce the 8 bit exponent 
field e 100101 10 of the floating point number f resu it. The sign bit si = 0 of the floating point 
number fi becomes the sign bit s of the floating point number f resu it. All parts s, e, and m of the 
floating point number f resu i t are thus known and processing of the FADD 300 terminates. 

Of course, it should be understood that the floating point adder 100, 200, 300 system 
configurations, control logic implementations, and connections shown in FIG. 1-4 are merely 
intended to be exemplary, and that other configurations, implementations, and connections are 
possible and may be used as suitable. For example, although the FADDs 100, 200, 300 of FIGS. 
1, 2 and 4 are designed for handling single precision (32-bit) floating point numbers in 
compliance with the IEEE Std 754 round to nearest methodology, the FADDs 100, 200, 300 may 
be extended to process any size of floating point numbers as suitable, including, for example, 
double precision (64-bit) floating point numbers. 

The 32 bits of a single precision floating point number include one bit for a sign bit, eight 
bits for an exponent field, and 23 bits for the mantissa. For norm numbers, a bit value of one is 
understood to precede the 23 bits of the mantissa, becoming in effect, an implicit one most 
significant bit. A norm number has an exponent field that takes on a value between zero and 
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255. A denorm number, by contrast, has an implicit zero most significant bit of the mantissa, a 
mantissa that is not equal to zero, and an exponent field that is equivalent to zero. 

One notable difference between the FADDs 100, 200, 300 is that the FADD 300 is not 
designed to handle denorm numbers, while the FADDs 100 9 200 are equipped to handle denorm 
numbers. Of course, depending on the demands of a particular application, the capability of 
handling denorm numbers may or may not be necessary. It should be understood that, where 
necessary or as desired, all embodiments may be easily modified to handle denorm numbers as in 
FIGS. 1-3 or otherwise modified to handle only norm numbers as in FIG. 4. 

Of course, it should be understood that although inverters and complement stages are 
illustrated at various points in the FADDs 100, 200, 300, a given signal value and the 
complement of the given signal value are available at all places as suitable, even though inverters 
and complement stages are illustrated for clarity. For example, the functions performed by the 
complement stage 312 of FIG. 4 and the inverter 310 could be incorporated into the round 
control logic 326. Similarly, although certain control variable values overlap with mantissa 
values, in some instances, such values are illustrated separately for clarity. For example, in FIG. 
2, the control variable outputs J4, K4, and L4 of the right shifter 208 are actually equivalent by 
definition to values within the mantissa 1114 also output by the right shifter 208, these are drawn 
separately for clarity. 

The present invention can be implemented in digital electronic circuitry, or in computer 
hardware, firmware, software, or in combinations of them. Apparatus of the invention can be 
implemented in a computer program product tangibly embodied in a machine-readable storage 
device for execution by a programmable processor; and method acts of the invention can be 
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performed by a programmable processor executing a program of instructions to perform 
functions of the invention by operating on input data and generating output. The invention can 
be implemented advantageously in one or more computer programs that executable on a 
programmable system including at least one programmable processor coupled to receive data and 
instructions from, and to transmit data and instructions to, a data storage system, at least one 
input device, and at least one output device. Each computer program can be implemented in a 
high-level procedural or object-oriented programming language, or in assembly or machine 
p=, language if desired; and in any case, the language can be a compiled or interpreted language. 
,g Suitable processors include, by way of example, both general and special purpose 
W microprocessors. Generally, a processor will receive instructions and data from a read-only 
H memory and/or a random access memory. Generally, a computer will include one or more mass 

S - 

^ storage devices for storing data files; such devices include magnetic disks, such as internal hard 
disks and removable disks; magneto-optical disks; and optical disks. Storage devices suitable for 

U tangibly embodying computer program instructions and data include all forms of non- volatile 
memory, including by way of example semiconductor memory devices, such as EPROM, 
EEPROM, and flash memory devices; magnetic disks such as internal hard disks and removable 
disks; magneto-optical disks; and CD-ROM disks. Any of the foregoing can be supplemented 
by, or incorporated in, ASICs (application-specific integrated circuits). 

Although the present invention has been particularly described with reference to the 

20 preferred embodiments, it should be readily apparent to those of ordinary skill in the art that 

changes and modifications in the form and details may be made without departing from the spirit 



C:\TEMP\BEL-030 Application 07 30 01 FINAL.doc 



CONFIDENTIAL AND PROPRIETARY TO BELIEVE, INC. 
-66- 

Believe Ref No. BEL-030 Atty. Dkt. 069102-0278148 



and scope of the invention. It is intended that the appended claims include such changes and 
modifications. 
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